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SEQUENCE PROTOCOL 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Prof. DR. Werner Lubitz 

(B) ROAD: Schoenborngasse 12/7 

(C) CITx: Vienna 

(E) COUNTRY: Austria 

(F) POSTAL CODE: 1080 

(ii) TITLE OF INVENTION: New systems for the regulation 

of gene expression 

(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER— READABLE FORM: 

(A) DATA CARRIER: Floppy disk 

(B) COMPUTERS IBM PC compatible 

(C) OPERATING^ SYSTEM : PC-DOS/MS-DOS 

(D) SOFTWARE: \PatentIn Release #1.0, 
Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ 



NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 blase pairs 

(B) TYPE: nucleotide 

(C) STRAND EDNESS A both 

(D) TOPOLOGY: linWar 



(Vi) INITIAL ORIGIN: 
(A) ORGANISM: 



lambt 



la OR operator (wild-type) 
(Xi) SEQUENCE DESCRIPTION,: SEQ ID NO: 1: 



ACGTTAAATC TATCACCGCA AGGGATAAAT ATCTAA^ACC 6T6C6TGTTG ACTATTTTAC 60 
CTCTGGCGGT GATAATGGTT GC \ 82 



(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 82 base pad 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(vi) INITIAL ORIGIN: 

0 ORGANISM: lambda OR operator (mutant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ACGTTAAATC TATCACQGCA AGGGATAAAT ATCTAACACC GCGCGTGTTG ACTATTTTAC 60 
CTCTGGCGGT GATAATGQJT GC - 82 



(2) INFORMATION KOR SEQ ID NO: 3: 

(i) SEQUENCE ^CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE* nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(vi) INITIAL ORIGIN: 

(A) ORGANISM: lambda OL operator (wild-type) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ACATACAGAT AACCATCTGC GGTGATAAAT TATCTCTGGC GGTGTTGACA TAAATACCAC 
TGGCGGTGAT ACTGAGCACA TCAGC 



60 
85 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601\base pairs 

(B) TYPE: nucleoside 

(C) STRANDEDNESS :\ double strand 

(D) TOPOLOGY: botl 

(Vi) INITIAL ORIGIN: 

(A) ORGANISM: pAW12^ fragment 

(xi) CHARACTERISTICS: 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (106.. 816) 
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(ix) CHARACTERISTICS: 

(AX NAME/ KEY: CDS 

(B)\ LOCATION: 1144.. 1416 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



10 



15 



25 



35 



ATTTACTATG TTATGTTCTG AGG&3AGTGA AAATTCCCCT 
TCAATTGTTA TCAGCTATGC GCCGACCAGA ACACCTTGCC 
AGGCCACTGA CTAGCGATAA CTTTCCGJCAC AACGGAACAA 

2 0 TGGGTACTGT GGGTTTAGTG GTTGTAAAaA CACCTGACCG 
GAAGGTAAAC TCATCACCCC CAAGTCTgAc TATGCAGAAA 
CTCAGGGTCA ACGAGAATTA ACATTCCGTG AGGAAAGCTT 

GGTCATGGAA TTACCTTCAA CCTCAAGCCA \GAATGCAGAA 
GCTTACCCAT CTCTCCGCAT CACCTTTGGT AAAGGTTCTA 

3 0 TGCCTGAACA TGAGAAAAAA CAGGGTACTC ATACTCACTT 
AACCGCTTCA TACATCTCGT AGATTTCTCT GGGJGATTGAA 
AACTTTGAGA ATTTTTGTAA GCAATGCGGC GTTjlTAAGCA 

AAATAAAGCA CCAACGCCTG ACTGCCCCAT CCCCATCTTG 
TAAGCCAAGT TCATTTTTCT TTTTTTCATA AATTGCTTTA 

4 0 CTGCTCTTGT GTTAATGGTT TCTTTTTTGT GCTCATApGT 

ATAAATATCT AACACCGCGC GTGTTGACTA TTTTAO 
GTACTAAGTA GGTTGTATGG AACAACGCAT AACO 

45 

GCAAACCAAG ACAGCTAAAG ATCCTCTAGA GTCGACCTGC^ 

TTCTCATTCA GGCTTCTGCC GTTTTGGATT TAACCGAAGA 

50 GTAACAAAGT TTGGATTGCT ACTGACCGCT CTCGTGCTCG 

TTT ATG GTA CGC TGG ACT TTG TGG GAT ACC CTC 
Met Val Arg Trp Thr Leu Trp Asp Thr Leu 
1 5 10 

55 

TTG AGT TTA TTG CTG CCG TCA TTG CTT ATT ATG 
Leu Ser Leu Leu Leu Pro Ser Leu Leu lie Met 
20 25 

60 TTC AAA CGG CCT GTC TCA TCA TGG AAG GCG CTG 
Phe Lys Arg Pro Val Ser Ser Trp Lys Ala Leu 
35 40 



AATTCGATGA AGATTCTTGC 60 

GATCAGCCAA ACGTCTCTTC 12 0 

CTCTCATTGC ATGGGATCAT 180 

CTATCCCTGA TCAGTTTCTT 240 

TCACCTGGCT CAACAGCCTG 300 

GGCTTGGAGC CTGTTGGTGC 360 

TCACTGGCTT TTTTGGTTGT 420 

AGCTTAGGTG AGAACATCCC 4 80 

CTAAGTGACG GCTGCATACT 540 

GGGCTAAATT CTTCAACGCT 600 

TTTAATGCAT TGATGCCATT 660 

TCTGCGACAG ATTCCTGGGA 720 

AGGCGACGTG CGTCCTCAAG 780 

TAAATCTATC ACCGCAAGGG 840 

GGCGGTGATA ATGGTTGCAT 900 

GATTATGCAA TGCGCTTTGG 960 

t AGGCATGCAA GCTTATCGAA 1020 

TTTCGAT TTTCTGACGA 1080 

IGCTGCGTT GAGGCTTGCG 1140 

GCT TTC CTG CTC CTG 1188 
Al^ Phe Leu Leu Leu 
15 

TTC &TC CCG TCA ACA 1236 
Phe I\le Pro Ser Thr 
30 

AAT TTA CGG AAA ACA 1284 
Asn LeiuArg Lys Thr 
45 



65 
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TTA TTA A1 
Leu Leu 



5 TTA CCT TGC 
Leu Pro Cys 
65 



GCG TCG AGC GTC CGG TTA AAG CCG 
Ala Ser Ser Val Arg Leu Lys Pro 
55 

TAC GCG GAG GAA ACA CTG ACG 
Tyr Ala Gin Glu Thr Leu Thr 
70 



AAG AAA ACG TGC\GTC AAA AAT TAC GTG CAG AAG 
10 Lys Lys Thr Cys Val Lys Asn Tyr Val Gin Lys 
80 \ 85 90 

TCTAAAGGTA AAAAACCTTC TGGCGCTCGC CCTGGTCGTC 

15 AAAGGCAAGC GTAAAGGCGC TCGTCTTTGG TATGTAGGTG 

GGGCTTCGGC CCTTACTTgA GGATAAATTA TGTCTAATAT 



CTG AAT TGT TCG CGT 1332 
Leu Asn Cys Ser Arg 
60 

TTC TTA CTG ACG CAG 1380 
Phe Leu Leu Thr Gin 
75 

GAG TGATGTAATG 1426 
Glu 



CGCAGCCGTT GCGAGGTACT 14 86 
GTCAACAATT TTAATTGCAG 154 6 
TCAAACTGGC GCCGA . 1601 



20 




(2) INFORMATION 



SEQ ID NO: 5: 



25 



30 



(i) SEQUENCE CftARACTERISTICS: 

(A) LENGTH\ 2 37 amino acids 

(B) TYPE: a\nino acid 
(D) TOPOLOG&: linear 

(ii) TYPE OF MOLECULE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



35 Met Ser Thr Lys Lys Lys Pro Leu Thr\Gln Glu Gin Leu Glu Asp Ala 
15 \ 10 



15 



40 



Arg Arg Leu Lys Ala He Tyr Glu Lys lVs Lys Asn Glu Leu Gly Leu 
20 25 \ 30 

Ser Gin Glu Ser Val Ala Asp Lys Met GlyWt Gly Gin Ser Gly Val 
35 40 \ 45 

Gly Ala Leu Phe Asn Gly He Asn Ala Leu A^n Ala Tyr Asn Ala Ala 
45 50 55 \ 60 

Leu Leu Thr Lys He Leu Lys Val Ser Val G1u\g1u Phe Ser Pro Ser 
65 70 75 \ 80 

so He Ala Arg Glu He Tyr Glu Met Tyr Glu Ala vll Ser Met Gin Pro 

85 90 \ 95 



55 



Ser Leu Arg Ser Glu Tyr Glu Tyr Pro Val Phe Ser Vis Val Gin Ala 
100 105 \ 110 

Gly Met Phe Ser Pro Lys Leu Arg Thr Phe Thr Lys GlV Asp Ala Glu 
H5 120 i2f 

Arg Trp Val Ser Thr Thr Lys Lys Ala Ser Asp Ser Ala W Trp Leu 
60 130 135 140 v 

Glu Val Glu Gly Asn Ser Met Thr Ala Pro Thr Gly Ser Ly\ Pro Ser 
145 150 155 \ 160 
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Phe Pro Asp Gly Met L u He L u Val Asp Pro Glu Gin Ala Val Glu 
V 165 170 175 

. V 

Pro Gly Asp Phfe Cys He Ala Arg Leu Gly Gly Asp Glu Phe Thr Phe 
5 184 185 190 

Lys Lys Leu IleVArg Asp Ser Gly Gin Val Phe Leu Gin Pro Leu Asn 
195 \ 200 205 

io Pro Gin Tyr Pro Met He Pro Cys Asn Glu Ser Cys Ser Val Val Gly 
210 \ 215 220 

Lys Val He Ala Se\ Gin Trp Pro Glu Glu Thr Phe Gly 
225 \ 230 235 

15 



20 



25 



(2) INFORMATION \FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



30 



35 



40- 



45 



Met Val Arg Trp Thr Leu Trft Asp Thr Leu Ala Phe Leu Leu Leu Leu 
1 5 \ 10 15 

Ser Leu Leu Leu Pro Ser Leu ^eu lie Met Phe lie Pro Ser Thr Phe 
20 \ 25 30 

Lys Arg Pro Val Ser Ser Trp ilys Ala Leu Asn Leu Arg Lys Thr Leu 
35 \ 0 45 

Leu Met Ala Ser Ser Val Arg LeVi Lys Pro Leu Asn Cys Ser Arg Leu 
50 55 \ 60 

Pro Cys Val Tyr Ala Gin Glu Thi\ Leu Thr Phe Leu Leu Thr Gin Lys 
65 70 \ 75 80 

Lys Thr Cys Val Lys Asn Tyr Val \Gln Lys Glu 
85 \ 90 



SO 



(2) INFORMATION FOR SEQ ID\NO: 7: 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2834 fcteise pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: Rouble strand 

(D) TOPOLOGY: both 
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10 



15 



(vi) INITIAL ORIGIN: 

(\) ORGANISM: pCS J fragment 

( x i ) CHA&ACTERI STI CS : 

(A) \ NAME/KEY: CDS 

(B) VjOCATION: complement (106.. 816) 

(ix) CHARACTERISTICS: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1025. ,2104 

( ix) CHARACTERISTICS : 

(A) NAMB/KEY: CDS 

(B) LOCATION: 2377.. 2649 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATTTACTATG TTATGTTCTG AGGGGAGTGA AAATTCCCCT AATTCGATGA AGATTCTTGC 

TCAATTGTTA TCAGCTATGC GC(?GACqAGA ACACCTTGCC GATCAGCCAA ACGTCTCTTC 

AGGCCACTGA CTAGCGATAA CTTTCCCqAC AACGGAACAA CTCTCATTGC ATGGGATCAT 

TGGGTACTGT GGGTTTAGTG GTTGTAAAAA CACCTGACCG CTATCCCTGA TCAGTTTCTT 

GAAGGTAAAC TCATCACCCC CAAGTCTGG& TATGCAGAAA TCACCTGGCT CAACAGCCTG 

CTCAGGGTCA ACGAGAATTA ACATTCCGTC\AGGAAAGCTT GGCTTGGAGC CTGTTGGTGC 

GGTCATGGAA TTACCTTCAA CCTCAAGCCA &AATGCAGAA TCACTGGCTT TTTTGGTTGT 

GCTTACCCAT CTCTCCGCAT CACCTTTGGT A&AGGTTCTA AGCTTAGGTG AGAACATCCC 

TGCCTGAACA TGAGAAAAAA CAGGGTACTC ATkcTCACTT CTAAGTGACG GCTGCATACT 

AACCGCTTCA TACATCTCGT AGATTTCTCT GGdpATTGAA GGGCTAAATT CTTCAACGCT 

AACTTTGAGA ATTTTTGTAA GCAATGCGGC GTTjJfTAAGCA TTTAATGCAT TGATGCCATT 

AAATAAAGCA CCAACGCCTG ACTGCCCCAT CCCCATCTTG TCTGCGACAG ATTCCTGGGA 

TAAGCCAAGT TCATTTTTCT TTTTTTCATA AATTG&TTTA AGGCGACGTG CGTCCTCAAG 

CTGCTCTTGT GTTAATGGTT TCTTTTTTGT GCTCAT^pGT TAAATCTATC ACCGCAAGGG 

ATAAATATCT AACACCGCGC GTGTTGACTA TTTTACC^T GGCGGTGATA ATGGTTGCAT 

GTACTAAGTA GGTTGTATGG AACAACGCAT AACCCTGAA& GATTATGCAA TGCGCTTTGG 

GCAAACCAAG ACAGCTAAAG ATCCTCTAGA GCGCCCGGAa\gAGAGTCAAT TCAGGGTGGT 

GAAT GTG AAA CCA GTA ACG TTA TAC GAT GTC GCA\GAG TAT GCC GGT GTC 
Val Lys Pro Val Thr Leu Tyx Asp Val Ala Glu Tyr Ala Gly Val 
95 100 \ 105 

TCT TAT CAG' ACC GTT TCC CGC GTG GTG AAC CAG Gck AGC CAC GTT TCT 
Ser Tyr Gin Thr Val Ser Arg Val Val Asn Gin AlaXSer His Val Ser 
110 lis \ 120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1069 

1117 



GCG 
Ala 



ATT CCC 
lie Pro 
140 
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ACG CGG GAA AAA GTG GAA GCG GCG ATG GCG GAG CTG AAT TAC 
Thr Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu Asn Tyr 
.125 130 135 

\C CGC GTG GCA CAA CAA CTG GCG GGC AAA CAG TCG TTG CTG 
Arg Val Ala Gin Gin Leu Ala Gly Lys Gin Ser Leu Leu 
145 150 



ATT GGC GTT 
10 lie Gly Val 
155 



:C ACC TCC AGT CTG GCC CTG CAC GCG CCG TCG CAA ATT 
Thr Ser Ser Leu Ala Leu His Ala Pro Ser Gin lie 
160 165 170 



15 



20 



GTC GCG GCG ATT 
Val Ala Ala He 



GTG TCG ATG GTA 
Val Ser Met Val 
190 



CAC AAT CTT CTC GCG 
His Asn Leu Leu Ala 
205 



TCT CGC GCC GAT CAA CTG GGT GCC AGC GTG GTG 
(3 Ser Arg Ala Asp Gin Leu Gly Ala Ser Val Val 
?5 180 185 



CGA AGC GGC GTC GAA GCC TGT AAA GCG GCG GTG 
Ser Gly Val Glu Ala Cys Lys Ala Ala Val 
195 200 

CGC GTC AGT GGG CTG ATC ATT AAC TAT CCG 
Arg Val Ser Gly Leu He He Asn Tyr Pro 
210 215 



25 CTG GAT GAC CAG GAT GCC 
Leu Asp Asp Gin Asp Ala 

220 22 



30 



CCG GCG TTA TTT CTT GAT GTC 
Pro Ala Leu Phe Leu Asp Val 
235 240 



GCT GTG GAA GCT GCC TGC ACT AAT GTT 
Ala Val Glu Ala Ala Cys Thr Asn Val 
230 

CT GAC CAG ACA CCC ATC AAC AGT ATT 
v er Asp Gin Thr Pro He Asn Ser He 
245 250 



40 



45 



60 




ATT TTC TCC CAT GAA GAC GGT Ai 
He Phe Ser His Glu Asp Gly 
35 255 



CGA CTG GGC GTG GAG CAT CTG GTC 
g Leu Gly Val Glu His Leu Val 
260 265 



GCA TTG GGT CAC CAG CAA ATC GCG CflG TTA GCG GGC CCA TTA AGT TCT 
Ala Leu Gly His Gin Gin He Ala Leu Leu Ala Gly Pro Leu Ser Ser 
270 275\ 280 



GTC TCG GCG CGT CTG CGT CTG GCT GGC 
Val Ser Ala Arg Leu Arg Leu Ala Gly T^ 
285 290 

AAT CAA ATT CAG CCG ATA GCG GAA CGG GAA^ 
Asn Gin He Gin Pro He Ala Glu Arg Glu 
300 305 



CAT AAA TAT CTC ACT CGC 
His Lys Tyr Leu Thr Arg 
295 



GAC TGG AGT GCC ATG 
Asp Trp Ser Ala Met 
310 



TCC GGT TTT CAA CAA ACC ATG CAA ATG CTG AAT GAG GGC ATC GTT CCC 
so Ser Gly Phe Gin Gin Thr Met Gin Met Leu AsnAGlu Gly He Val Pro 
315 320 325 \ 330 



ACT GCG ATG CTG GTT GCC AAC GAT CAG ATG GCG 
Thr Ala Met Leu Val Ala Asn Asp Gin Met Ala 
55 335 340 



GCC ATT ACC GAG TCC GGG CTG CGC GTT GGT GCG GAT 
Ala lie Thr Glu Ser Gly Leu Arg Val Gly Ala Asp 
350 355 



GGC GCA ATG CGC 
Gly Ala Met Arg 
345 

C TCG GTA GTG 
e Ser Val Val 
360 



GGA TAC GAC GAT ACC GAA GAC AGC TCA TGT TAT ATC CCGVCCG TCA ACC 
Gly Tyr Asp Asp Thr Glu Asp Ser Ser Cys Tyr He Pro fro Ser Thr 
365 370 375 



65 ACC ATC AAA CAG GAT TTT CGC CTG CTG GGG CAA ACC AGC GTG GAC CGC 
Thr He Lys Gin Asp Phe Arg Leu Leu Gly Gin Thr Ser Val Asp Arg 
380 385 390 



1165 



1213 



1261 



1309 



1357 



1405 



1453 



1501 



1549 



1597 



1645 



1693 



1741 



1789 



1837 



1885 



1933 
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TTG ctg\caa ctc tct cag ggc cag gcg gtg aag ggc aat cag ctg ttg 

Leu Leu Gin Leu Ser Gin Gly Gin Ala Val Lys Gly Asn Gin Leu Leu 
3 95 \ 400 405 410 



5 CCC GTC 
Pro Val 



CTG GTG AAA AGA AAA ACC ACC CTG GCG CCC AAT ACG CAA 
Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Asn Thr Gin 
415 420 425 

ACC GCC TCT \CC CGC GCG TTG GCC GAT TCA TTA ATG CAG CTG GCA CGA 
10 Thr Ala Ser Pro Arg Ala Leu Ala Asp Ser Leu Met Gin Leu Ala Arg 
\0 435 440 

CAG GTT TCC CGA CTG GAA AGC GGG CAG TGAGCGCAAC GCAATTAATG 
Gin Val Ser Arg\Leu Glu Ser Gly Gin 
445 \ 450 

TGAGTTAGCT CACTCACTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 

TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTCTGCA GGCATGCAAG 

CTTATCGAAT TCTCATTCW? GCTTCTGCCG TTTTGGATTT AACCGAAGAT GATTTCGATT 

TTCTGACGAG TAACAAAGTTVrGGATTGCTA CTGACCGCTC TCGTGCTCGT CGCTGCGTTG 



15 



20 



25 AGGCTTGCGT TT ATG GTA 
Met Val 
1 

CTG CTC CTG TTG AGT TTA 
30 Leu Leu Leu Leu Ser Leu 
15 



TGG ACT TTG TGG GAT ACC CTC GCT TTC 
Trp Thr Leu Trp Asp Thr Leu Ala Phe 
5 10 

CTG CCG TCA TTG CTT ATT ATG TTC ATC 
3oi Leu Pro Ser Leu Leu lie Met Phe He 
20 25 



35 



40 



CCG TCA ACA TTC AAA CGG CCtXgTC TCA TCA TGG AAG GCG CTG AAT TTA 
Pro Ser Thr Phe Lys Arg Pro Val Ser Ser Trp Lys Ala Leu Asn Leu 
30 35 \ 40 

CGG AAA ACA TTA TTA ATG GCG T&G AGC GTC CGG TTA AAG CCG CTG AAT 
Arg Lys Thr Leu Leu Met Ala Set Ser Val Arg Leu Lys Pro Leu Asn 

50 * " 



55 



60 



TGT TCG CGT TTA CCT TGC GTG TAC >GCG CAG GAA ACA CTG ACG TTC TTA 




Cys Ser Arg Leu Pro Cys Val Tyr 
65 

45 CTG ACG CAG AAG AAA ACG TGC GTC 
Leu Thr Gin Lys Lys Thr Cys Val 
80 



a Gin Glu Thr Leu Thr Phe Leu 
70 75 

AAT* TAC GTG CAG AAG GAG 
Asn Tyr Val Gin Lys Glu 
90 



TGATGTAATG TCTAAAGGTA AAAAACGTTC TGGCGCTCGC CCTGGTCGTC CGCAGCCGTT 



GCGAGGTACT AAAGGCAAGC GTAAAGGCGC TCG 
TTAATTGCAG GGGCTTCGGC CCTTACTTGA GGAT 
55 GCCGA 



lCTTTGG 



TATGTAGGTG GTCAACAATT 
lTTA TGTCTAATAT TCAAACTGGC 



1981 

2029 

2077 

2124 

2184 
2244 
2304 
2364 
2412 

2460 

2508 

2556 

2604 

2649 

2709 
2769 
2829 
2834 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 37 amino acid\ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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ii) TYPE OF MOLECULE: protein 

(341) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

s Men Ser Thr Lys Lys Lys Pro Leu Thr Gin Glu Gin Leu Glu Asp Ala 
1 \ 5 10 if 



10 



Arg Arg Leu\ Lys Ala lie Tyr Glu Lys Lys Lys Asn Glu Leu Gly Leu 
20 25 30 

Ser Gin Glu ^er Val Ala Asp Lys Met Gly Met Gly Gin Ser Gly Val 
35 \ 40 45 

Gly Ala Leu Ph4 Asn Gly lie Asn Ala Leu Asn Ala Tyr Asn Ala Ala 
is 50 \ 55 60 



Leu Leu Thr Lys ^le 
65 



Leu Lys Val Ser Val Glu Glu Phe Ser Pro Ser 
70 75 80 



20 lie Ala Arg Glu life Tyr Glu Met Tyr Glu Ala Val Ser Met Gin Pro 

85V 90 95 



25 



Ser Leu Arg Ser Glu Tyr Glu Tyr Pro Val Phe Ser His Val Gin Ala 
100 \ 105 no 

Gly Met Phe Ser Pro lAs Leu Arg Thr Phe Thr Lys Gly Asp Ala Glu 
115 \ 120 125 



Arg Trp Val Ser Thr ThrXLys Lys Ala Ser Asp Ser Ala Phe Trp Leu 
30 130 \135 140 

Glu Val Glu Gly Asn Ser &et Thr Ala Pro Thr Gly Ser Lys Pro Ser 
145 150 \ 155 160 

35 Phe Pro Asp Gly Met Leu Il)e Leu Val Asp Pro Glu Gin Ala Val Glu 

165 \ 170 175 

Pro Gly Asp Phe Cys lie Ala\Arg Leu Gly Gly Asp Glu Phe Thr Phe 
180 \ 185 190 



40 

Lys Lys Leu lie Arg Asp Ser (Sly Gin Val Phe Leu Gin Pro Leu Asn 
195 200 205 

Pro Gin Tyr Pro Met lie Pro C$b Asn Glu Ser Cys Ser Val Val Gly 
4S 210 215 \ 220 

Lys Val lie Ala Ser Gin Trp Pro\Glu Glu Thr Phe Gly 
225 230 \ 235 

50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 60 amiho acids 

(B) TYPE: amino acid\ 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 
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) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Val Lys P^p Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala Gly Val Ser 
1 \ 5 10 15 

Tyr Gin Thr Val Ser Arg Val Val Asn Gin Ala Ser His Val Ser Ala 
x *0 25 30 

Lys Thr Arg Gl\i Lys Val Glu Ala Ala Met Ala Glu Leu Asn Tyr He 
35 \ 40 45 

Pro Asn Arg Val Ala Gin Gin Leu Ala Gly Lys Gin Ser Leu Leu He 
50 \ 55 60 

Gly Val Ala Thr Stfr Ser Leu Ala Leu His Ala Pro Ser Gin He Val 
g 5 \ 70 75 80 

Ala Ala He Lys Ser Wg Ala Asp Gin Leu Gly Ala Ser Val Val Val 
85 \ 90 95 

Ser Met Val Glu Arg Se^r Gly Val Glu Ala Cys Lys Ala Ala Val His 
100 \ 105 no 

Asn Leu Leu Ala Gin Arg Val Ser Gly Leu He He Asn Tyr Pro Leu 
115 \ 120 125 

Asp Asp Gin Asp Ala He Alk Val Glu Ala Ala Cys Thr Asn Val Pro 
"0 139v 140 

Ala Leu Phe Leu Asp Val Ser Asp Gin Thr Pro He Asn Ser He He 
145 150 \ 155 160 

Phe Ser His Glu Asp Gly Thr Arg Leu Gly Val Glu His Leu Val Ala 
165 Y 170 175 

Leu Gly His Gin Gin He Ala Leu\Leu Ala Gly Pro Leu Ser Ser Val 
180 185 190 

Ser Ala Arg Leu Arg Leu Ala Gly t\p His Lys Tyr Leu Thr Arg Asn 
195 200 \ 205 

Gin He Gin Pro He Ala Glu Arg Gl^Gly Asp Trp Ser Ala Met Ser 
210 215 \ 220 

Gly Phe Gin Gin Thr Met Gin Met Leu Asn Glu Gly He Val Pro Thr 
225 230 \ 235 240 

Ala Met Leu Val Ala Asn Asp Gin Met Ala\ Leu Gly Ala Met Arg Ala 
245 250\ 255 

He Thr Glu Ser Gly Leu Arg Val Gly Ala Asp He Ser Val Val Gly 
260 265 \ 270 

Tyr Asp Asp Thr Glu Asp Ser Ser Cys Tyr Ilk Pro Pro Ser Thr Thr 
2 ?5 280 \ 285 

He Lys Gin Asp Phe Arg Leu Leu Gly Gin Thr Ser Val Asp Arg Leu 
290 295 3C~ 

Leu Gin Leu Ser Gin Gly Gin Ala Val Lys Gly AsriVGln Leu Leu Pro 
305 310 315 \ 320 

Val Ser Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Aten Thr Gin Thr 
325 330 \ 335 



Ala Ser Pro 
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Ala Leu Ala Asp Ser Leu Met Gin Leu Ala Arg Gin 
345 350 




Val Ser Arg Leu Glu S r Gly Gin 
s 355 \ 360 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 10: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENcte: 91 amino acids 

(B) TYPE:\amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



20 



Met Val Arg Trp Thr Leu Trp ASp Thr Leu Ala Phe Leu Leu Leu Leu 
1 5 \ 10 15 

25 

Ser Leu Leu Leu Pro Ser Leu LeuXlle Met Phe lie Pro Ser Thr Phe 
20 \25 30 

Lys Arg Pro Val Ser Ser Trp Lys Ala Leu Asn Leu Arg Lys Thr Leu 
o 35 40 \ 45 

Leu Met Ala Ser Ser Val Arg Leu Lys\pro Leu Asn Cys Ser Arg Leu 
50 55 \ 60 

35 Pro Cys Val Tyr Ala Gin Glu Thr Leu ihx Phe Leu Leu Thr Gin Lys 
65 70 \ 75 80 

Lys Thr Cys Val Lys Asn Tyr Val Gin hyk Glu 
85 9<JV 

40 



